FORBIDDEN GAP ARGUMENT FOR PHASE TRANSITIONS 
PROVED BY MEANS OF CHESSBOARD ESTIMATES 



MAREK BISKUP 1 AND ROMAN KOTECKY 2 

1 Department of Mathematics, UCLA, Los Angeles, California, USA 
2 Center for Theoretical Study, Charles University, Prague, Czech Republic 

Abstract: Chessboard estimates are one of the standard tools for proving phase coexistence in 
spin systems of physical interest. In this note we show that the method not only produces a point 
in the phase diagram where more than one Gibbs states coexist, but that it can also be used to 
rule out the existence of shift-ergodic states that differ significantly from those proved to exist. For 
models depending on a parameter (say, the temperature), this shows that the values of the conjugate 
thermodynamic quantity (the energy) inside the "transitional gap" are forbidden in all shift-ergodic 
Gibbs states. We point out several models where our result provides useful additional information 
concerning the set of possible thermodynamic equilibria. 



1. Introduction 

One of the basic tasks of mathematical statistical mechanics is to find a rigorous approach to 
various first-order phase transitions in lattice spin systems. Here two methods of proof are 
generally available: Pirogov-Sinai theory and chessboard estimates. The former, developed in 
[30, 31], possesses an indisputable advantage of robustness with respect to (general) perturba- 
tions, but its drawbacks are the restrictions — not entirely without hope of being eventually elimi- 
nated [22, 23, 15, 35, 7] — to (effectively) finite sets of possible spin values and to situations with 
rapidly decaying correlations. The latter method, which goes back to [20, 18, 19], is limited, for 
the most part, to systems with nearest-neighbor interactions but it poses almost no limitations on 
the individual spin space and/or the rate of correlation decay; see e.g. [29]. 

While both techniques ultimately produce a proof of phase coexistence, Pirogov-Sinai theory 
offers significantly better control of the number of possible Gibbs states. Indeed, one can prove 
the so called completeness of phase diagram [34, 8] which asserts that the states constructed by 
the theory exhaust the set of all shift-ergodic Gibbs states. (In technical terms, there is a one- 
to-one correspondence between the shift-ergodic Gibbs states and the "stable phases" defined in 
terms of minimal "metastable free energy".) Unfortunately, no conclusion of this kind is currently 
available in the approaches based solely on chessboard estimates. This makes many of the con- 
clusions of this technique — see [12, 33, 3, 5, 17] for a modest sample of recent references — seem 
to be somewhat "incomplete." 
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To make the distinction more explicit, let us consider the example of temperature-driven first- 
order phase transition in the g-state Potts model with q » 1. In dimensions d > 2, there exists a 
transition temperature, T t , at which there are q ordered states that are low on both entropy and en- 
ergy, and one disordered state which is abundant in both quantities. The transition is accompanied 
by a massive jump in the energy density (as a function of temperature). Here the "standard" proof 
based on chessboard estimates [25, 26] produces "only" the existence of a temperature where the 
aforementioned q + 1 states coexist, but it does not rule out the existence of other states; particu- 
larly, those with energies "inside" the jump. On the other hand, Pirogov-Sinai approaches [24, 27] 
permit us to conclude that no other than the above q + 1 shift-ergodic Gibbs states can exist at T t 
and, in particular, there is a. forbidden gap of energy densities where no shift ergodic Gibbs states 
are allowed to enter. 

The purpose of this note is to show that, after all, chessboard estimates can also be supple- 
mented with a corresponding "forbidden-gap" argument. Explicitly, we will show that the calcu- 
lations (and the assumptions) used, e.g., in [25, 12, 33, 3, 5, 17] to prove the existence of particular 
Gibbs states at the corresponding transition temperature, or other driving parameter, imply also 
the absence of Gibbs states that differ significantly from those proved to exist. We emphasize that 
no statement about the number of possible extremal, translation-invariant Gibbs states is being 
made here, i.e., the completeness of phase diagram in its full extent remains unproved. Notwith- 
standing, our results go some way towards a proof of completeness by ruling out, on general 
grounds, all but a "small neighborhood" of the few desired states (which may themselves be a 
non- trivial convex combination of extremal states). 

The assumptions we make are quite modest; indeed, apart from the necessary condition of re- 
flection positivity we require only translation invariance and absolute summability of interactions. 
And, of course, the validity — uniformly in the parameter driving the transition — of a bound that 
is generally used to suppress the contours while proving the existence of coexisting phases. We 
also remark that the conclusion about the "forbidden gap" should not be interpreted too literally. 
Indeed, there are systems (e.g., the Potts model in an external field) where more than one gap may 
"open up" at the transition. Obviously, in such situations one may have to consider a larger set of 
observables and/or richer parametrization of the model. We refer the reader to our theorems for 
the precise interpretation of the phrase "forbidden gap" in a general context. 

The main idea of the proof is that all Gibbs states (at the same temperature) have the same 
large-deviation properties on the scale that is exponential in volume. This permits us to compare 
any translation-invariant Gibbs state with a corresponding measure on the torus, where chess- 
board estimates can be used to rule out most of the undesirable scenarios. The comparison with 
torus boundary conditions requires a estimate on the interaction "across" the boundary; as usual 
this is implied by the absolute summability of interactions. This is the setting we assume for the 
bulk of this paper (cf Theorem 2.5). For systems with unbounded interactions, a similar con- 
clusion can be made under the assumption that the interactions are integrable with respect to the 
measures of interest (see Theorem 4.4). 

The rest of this paper is organized as follows: In Sect. 2.1 and 2.2 we define the class of mod- 
els to which our techniques apply and review various elementary facts about reflection positivity 
and chessboard estimates. The statements of our main theorems (Theorem 2.5 and Corollary 2.6) 
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come in Sect 2.3. The proofs constitute the bulk of Sect. 3; applications to recent results estab- 
lished by means of chessboard estimates are discussed in Sect. 4. The Appendix (Sect. 5) contains 
the proof of Theorem 4.4 which provides an explicit estimate on the energy gap from Theorem 3 
of [17]. This result is needed for one of our applications in Sect. 4. 



In order to formulate our principal claims we will first recall the standard setup for proofs of 
first-order phase transitions by chessboard estimates and introduce the necessary notations. The 
actual theorems are stated in Sect. 2.3. 

2.1 Models of interest. 

We will work with the standard class of spin systems on Z d and so we will keep our discussion 
of general concepts at the minimum possible. We refer the reader to Georgii's monograph [21] 
for a more comprehensive treatment and relevant references. 

Our spins, s x , will take values in a compact separable metric space Qq. We equip Qq with 
the cr-algebra ^ of its Borel subsets and consider an a priori probability measure vq on (Oo, &o). 
Spin configurations on Z d are the collections (s x ) xeZ d. We will use ft = S7q to denote the set of 
all spin configurations on Z d and & to denote the cr-algebra of Borel subsets of Q defined using 
the product topology. If A C Ij d , we define J^a to be the sub-u-algebra of events depending 
only on (s x ) xe A. For each x 6 Z d , the map t x : VL — ► Q. is the "translation by x" defined 
by (r x s) y = s x+y . It is easy to check that t x is a continuous and hence measurable for all x £ Z d . 
We will write A <<= Z d to indicate that A is a finite subset of Z d . 

To define Gibbs measures, we will consider a family of Hamiltonians (H\) AmZ d. These will be 
defined in terms of interaction potentials (&A)A<&. d - Namely, for each A <e Z d , let &a '■ ^ — ► R 
be a function with the following properties: 

(1) The function &a is ^4 -measurable for each A d Z d . 

(2) The interaction (&a) is translation invariant, i.e., <&a+x = &A t x for all x G Z d and 
all A <s Z d . 

(3) The interaction (<3?a) is absolutely summable in the sense that 



2. Main result 




(2.1) 



A<gL d 
oeA 



The Hamiltonian on a set A g Z' 



is a function H\ : f2 — > R defined by 




(2.2) 



A<gL d 
AnA^0 



For each f3 > 0, let <&p be the set of Gibbs measures for the Hamiltonian (2.2). Specifically, 
jj, G <5p if and only if the conditional probability /x( • |^a c ) — which exists since Q is a Polish 
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space — satisfies, for all AiZ d and ^-almost all s, the (conditional) DLR equation 



f3H A (s) 



[J u {ds x ). 



A*(da A |^Ae)(s) = 



(2.3) 



Here Z\ = Za(/3, sa c ) is a normalization constant which is independent of s\ = (s x ) x( =a- 

Remark 2. 1 The results of the present paper can be generalized even to the situations with un- 
bounded spins and interactions; see Theorem 4.5. However, the general theory of Gibbs measures 
with unbounded spins features some unpleasant technicalities that would obscure the presenta- 
tion. We prefer to avoid them and to formulate the bulk of the paper for systems with compact 
spins. Our restriction to translation-invariant interactions in (2) above is mostly for convenience 
of exposition. Actually, the proofs in Sect. 3 can readily be modified to include periodic interac- 
tions as well. 

2.2 Chessboard estimates. 

As alluded to before, chessboard estimates are among the principal tools for proving phase co- 
existence. In order to make this tool available, we have to place our spin system on a torus. 
Let Tl be the torus of L x • • • x L sites and let Hp : Q L — ► R be the function defined as fol- 
lows. Given a configuration s = (s x ) x< =j L , we extend s periodically to a configuration s on all 
of Z d . Using Hf L to denote the Hamiltonian associated with the embedding of Tx, into 7L d , we 
define Hp{s) = Hj L (s). The torus measure Fp^ then simply is 



Here Zp = Zp {(3) is the torus partition function. 

Chessboard estimates will be implied by the condition of reflection positivity. While this con- 
dition can already be defined in terms of interactions ($A)Acz d > ^ * s often easier to check it 
directly on the torus. Let us consider a torus with even L and let us split it into two sym- 
metric halves, and T^, sharing a "plane of sites" on their boundary. We will refer to the 
set P = T| n T L as a plane of reflection. Let ^p and &p denote the cr-algebras of events 
depending only on configurations in and , respectively. 

We assume that the naturally-defined (spatial) reflection d p : <-> gives rise to a map 
Op : J7q L — > S7q L which obeys the following constraints: 

(1) Op is an involution, Op o Op = id. 

(2) Op is a reflection in the sense that if A € ^p depends only on configurations in A C Tj, 
then Op {A) € ^p depends only on configurations in $p(A). 

In many cases of interest, Op is simply the mapping that is directly induced by the spatial re- 
flection dp, i.e., Op = i?p, where (#p(s)) = s^ p ( x y, our definition permits us to combine the 
spatial reflection with an involution of the single-spin space. 

Reflection positivity is now defined as follows: 



[ [ i/ (ds x ). 

xdT L 



(2.4) 



Zl 
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Definition 2.2 Let P be a probability measure on Q Q L and let E be the corresponding expecta- 
tion. We say that P is reflection positive, if for any plane of reflection P and any two bounded 
&p -measurable random variables X and Y, 

E(X9 P (Yj) = R(Y6 P (X)) (2.5) 

and 

E(X0 P (X)) > 0. (2.6) 
Here, 9p(X) denotes the -measurable random variable X o 9 P . 

Remark 2.3 Here are some standard examples of summable two-body interactions that are re- 
flection positive. Consider spin systems with vector- valued spins s x and interaction potentials 

^{x,y} = Jx,y {S x , Sy), X / y, (2.7) 

where J x<y are coupling constants and (-, •) denotes a positive-semidefmite inner product on Q. 
Then the corresponding torus Gibbs measure with (3 > is reflection positive (for reflections 
through sites) for the following choices of J^y's: 

(1) "Cube" interactions: Reflection-symmetric J^y's such that J X:V = unless x and y are 
vertices of a cube of 2 x • • • x 2 sites in Z d . 

(2) Yukawa-type potentials: 

J x ,y = e-^ x -y\\ (2.8) 

where \i > and \x — y\i is the t x -distance between x and y. 

(3) Power-law decaying interactions: 

1 

\x - y\f ' 

with x > 0. 



Jx,y — i . i„i (2-9) 



The proofs of these are based on the general theory developed in [20, 18, 19]; relevant calculations 
can also be found in [2, Sect. 4.2]. Of course, any linear combination of the above — as well as 
other reflection-positive interactions — with positive coefficients is still reflection positive. 

Now, we are finally getting to the setup underlying chessboard estimates. Suppose that L is 
an integer multiple of an (integer) number B. (To rule out various technical complications with 
the following theorem, we will actually always assume that L /p is a power of 2.) Let C 
be the box of (B + 1) x ■ ■ ■ x (B + 1) sites with the "lower-left" corner at the origin — we will 
call such box a B -block. We can tile by translates of by S-multiples of vectors from the 
factor torus, T = T L / B . Note that the neighboring translates of A# will have a side in common. 
Let A be an event depending only on configurations in A^; we will call such A a B-block event. 
For each t G T, we define the event 9t{A) as follows: 

(1) If t has all components even, then 9 t (A) is simply the translation of A by vector Bt, i.e., 
9 t (A) = t b I(A) = {se nl L : T Bt (s) g A}. 

(2) For the remaining t G T, we first reflect A through the "midplane" of A^ in all directions 
whose component of t is odd, and then translate the result by Bt as before. 
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Thus, 6 t (A) will always depend only on configurations in the -block A# + Bt. 
The desired consequence of reflection positivity is now stated as follows. 

T 

Theorem 2.4 (Chessboard estimate) Let Fbea measure on Q L which is reflection-positive with 



respect to 9p. Then for any B-block events Ai, ■ ■ ■ , A m and any distinct sites t\, . . . , t m G T, 

p( n dtMi)) < n p ( n e *w) /m - ^ 

Proof. See [20, Theorem 2.2]. □ 



The moral of this result — whose proof is nothing more than an enhanced version of the Cauchy- 
Schwarz inequality applied to the inner product X, Y <— > E(X6p(Y)) — is that the probability of 
any number of events factorizes, as a bound, into the product of probabilities. This is particularly 
useful for contour estimates; of course, provided that the word contour refers to a collection of 
boxes on each of which some "bad" event occurs. Indeed, by (2.10) the probability of a contour 
will automatically be suppressed exponentially in the number of constituting "bad" boxes. 

2.3 Main theorems. 

For any .B-block event A, we introduce the quantity 

= L i^ o ( p ^( n^))) /|TI > ( 2 - n) 

teT 

with the limit taken over multiples of B. The limit exists by standard subadditivity arguments. 
While the definition would suggest that pp(A) is a large-deviation rate, chessboard estimates 
(2.10) show that pp(A) can also be thought of as the "probability of A regardless of the status 
of all other -blocks." This interpretation is supported by the fact that A i-> pp{A) is an outer 
measure on with pp(£l) = 1, cf. Lemma 6.3 of [5]. 

Furthermore, recalling that A^-i is the block of N x • • • x N sites with the "lower-left" corner 
at the lattice origin, let 

Rn(A) = --^— V \ A or Bx (2.12) 

be the fraction of S-blocks (in Ajvb-i) in which A occurs. Whenever p, G <8p is a Gibbs 
state for the Hamiltonian (2.2) at inverse temperature [5 that is invariant with respect to the 
shifts (TBx)xei, d ' tne 1*™* 

Pfl (A) = lim R N (A) (2.13) 

N— >oo 

exists ^-almost surely. In the following, we will use Pll (A) mostly for measures that are actually 
ergodic with respect to the shifts by multiples of B. In such cases the limit is self-averaging, 
Pij,(A) = p(A) almost surely. Notwithstanding, we will stick to the notation p^{A) to indicate 
that claims are being made about almost-sure properties of configurations and not just expecta- 
tions. To keep our statements concise, we will refer to measures which are invariant and ergodic 
with respect to the translations (TBx) x ei* d as B -shift ergodic. 
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Our principal result can be formulated as follows: 

Theorem 2.5 Let d > 2 and consider a spin system as described above for which that the torus 
measure is reflection positive for all (5 > and all even L > 2. Let Gi, . . . ,Q r be a finite number 
of B -block events and let B = (Gi U • • • U Q r ) c - Suppose that the good block events are mutually 
exclusive and non- compatible (different types of goodness cannot occur in neighboring blocks): 

(1) GiHGj = foralli^j. 

(2) Ifti,t2 G T are nearest neighbors, then 

tl {Gi) n 9 t2 (Gj) = far all i ± j. (2. 14) 

Then for every e > 0, there exists 5 > — which may depend on d but not on the details of the 
model nor on B or n — such that for any (5 > with pp(B) < 5 we have 

Pli (B)€[0,e] (2.15) 

and 

Pn{Gi) € [0, e] U [1 — e, 1], i = l,...,r, (2.16) 

for every B -shift ergodic Gibbs state p € In particular, if e < V2 then for every such \x there 
exists a unique i such that p^Gi) > 1 — e and p^iGj) < efor all j 7^ i. 

We remark that the conclusion of Theorem 2.5 holds even when the requirement of compact 
single-spin space and norm-bounded interactions are relaxed to the condition of finite average en- 
ergy. We state the corresponding generalization in Theorem 4.5. Theorem 2.5 directly implies the 
standard conclusion of chessboard estimates (cf. [14, Propositions 3.1-3.3] or [25, Theorem 4]): 

Corollary 2.6 Let d > 2, let 0\ < $2 he two inverse temperatures and let Gi and Gi be two mu- 
tually exclusive, non-compatible good B -block events (cf conditions (1) and (2) in Theorem 2.5). 
Then, for every e > there exists a constant 5 > — which may depend on d but not B or the 
details of the model — such that the conditions 

(1) p p (B) < 5 for all (3 G [ft, ft] and 

(2) ^(£2) < 6andpp 2 (Gi) < 5 

imply an existence of an inverse temperature ft G (ft, ft) and of two distinct B-shift ergodic 
Gibbs measures pi, P2 £ (5/3, such that 

P H (Gj)>l-e, i = 1,2. (2.17) 

The above assumptions (1) and (2) appear in some form in all existing proofs based on chess- 
board estimates; see Sect. 4 for some explicit examples. The conclusions about the set of coex- 
istence points can be significantly strengthened when, on the basis of thermodynamic arguments 
and/or stochastic domination, the expected amount of goodness G2 increases (and Gi decreases) 
with increasing ft For e <C 1 the phase diagram then features a unique (massive) jump at some ft 
from states dominated by Gi to those dominated by G2- Theorem 2.5 implies that the bulk of 
the values inside the jump are not found in any ergodic Gibbs state. Both Theorem 2.5 and 
Corollary 2.6 are proved in Sect. 3.2. 
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Remark 2.7 Both results above single out inverse temperature as the principal parameter of inter- 
est. However, this is only a matter of convenience; all results hold equally well for any parameter 
of the model. An inspection of the proof shows that we can take 5 = c(d)e 2 / d in Theorem 2.5, 
where c(d) is a constant that grows with dimension. However, the dependence on e should be 
significantly better; we made no attempts to reach the optimum. In any case, the fact that 5 does 
not depend on the details of the model is definitely sufficient to prove phase coexistence. 



3. Proofs of main results 

We will assume that there is an ergodic Gibbs measure fi G (3/3 that violates one of the condi- 
tions (2.15-2.16), and derive a contradiction. Various steps of the proof will be encapsulated in 
technical lemmas in Sect. 3.1; the actual proofs come in Sect. 3.2. 

3.1 Technical lemmas. 

Our first step is to convert the information about infinite-volume densities into a finite volume 
event. Using the sites from Ajy_i to translate the B -block A# by multiples of B in each coor- 
dinate direction, we get UzeAjv 1 (^b + Bx) = A/v^. Similarly, considering translates of Anb 
by vectors NBx where x G A^-i, we get UzeA M _! (^nb + NBx) = Amnb ■ The important 
point is that, while the neighboring translates Anb + NBx and Ajvs + NBy are not disjoint, 
they have only one of their (d — 1) -dimensional sides in common. 
Let Bn and £j : N, j = 1, ■ ■ ■ , r, be events defined by 

B N = {R N (B) > e} (3.1) 

and 

£ h N = {R N (Gj) >e}, j = 1, . . . ,r. (3.2) 

Introducing the event 

£ N = B N U |J (£i, N n£ j:N ) (3.3) 

l<i<j<r 

and the fraction Rm,n(£n) of fW-blocks (in Amnb) in which Sn occurs, 

Rm,n(£n) = 7T f l£ N °T N Bx, (3-4) 

x&Am-i 

we have: 

Lemma 3.1 Let e < V2 ond consider a B-shift ergodic Gibbs measure [i G <£>g that violates 
one of the conditions (2.15-2.16). Then there exists an No < 00 and, for each N > Nq, there 
exists an Mo = Mo(N) such that for all N > No and all M > Mo(N), one has 

h{Rm,n(£n)>V2) >^d. (3.5) 
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Proof. The proof is based on a two-fold application of the Pointwise Ergodic Theorem. Indeed, 
by ergodicity of p and Fatou's lemma we know that 

liminf A i(B JV )>/x( / 9 (U (H)>e) (3.6) 

and 

liminf p{£ i>N n Sj, N ) > n{ {p^{Qi) > e} n {p^Gj) > e}). (3.7) 

But /x violates one of the conditions (2.15-2.16) and so either p^(B) > e or p^Qi) > e 
and Pfi(Gj) > e for some i / j. All of these inequalities are valid /x-almost surely and so it 
follows that 

p(£ N ) -— 1. (3.8) 

A<^oo 

Now, let us fix A so that p(£n) > 3 /4- Then ergodicity with respect to translates by multiples 
of B implies that 

m( (J (#M,iv(£7v) o T By > 1/2}) > m( ^ X] R m,n(£n) o T B y > ^) 

?/GA iV _ 1 j/eAjv_i (3.9) 

= h(R M n(£n) > V2) — ► 1. 

It follows that the left-hand side exceeds x li once M is sufficiently large, which in conjunction 
with subadditivity and r^-invariance of p directly implies (3.5). □ 

Our next task will be to express £n solely in terms of conditions on bad B-blocks in A^b = 
{J xe A N _ l (A-B + Bx). Given two distinct sites x,y G Ajy_i, let {x ^ y} denote the event that 
there is no nearest-neighbor path it = (x\, . . . , Xk) on Ajv-i such that 

(1) 7r connects x to y, i.e., x\ = x and Xk = y. 

(2) all B-blocks "along" it are good, i.e., tb Xj (£>°) occurs for all j = 1, . . . , k. 

Note that {x ^ y} automatically holds when one of the blocks Ab + Bx or Ab + By is bad. 
Further, let Y/v be the (<^a nb -measurable) random variable 

Yn = #{(x,y) e Aat_i x Ajv-i: x + ykx ^ y) (3.10) 

and let Cat be the event 

C N = {Y N > (eN d ) 2 }. (3.11) 
Conditions (1) and (2) from Theorem 2.5 now directly imply: 

Lemma 3.2 For all N, we have £n C Cat. 

Proof. Clearly, we have Bn C Cat, and so we only have to show that 

£i,N n £ jVV C C N , l<i<j<r. (3.12) 

Let us fix i / j and recall that on £ ^at n S^an at least an e-fraction of all B-blocks in Anb 
will be i-good and at least an e-fraction of them will be j-good. By conditions (1) and (2) from 
Theorem 2.5, no two i?-blocks of different type of goodness can be connected by a path of 
good U-blocks, and so there are at least (eN d ) 2 pairs of distinct S-blocks in Anb that are not 
connected to each other by a path of good blocks. This is exactly what defines the event Cat. □ 
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The events 8^ and Cjy have the natural interpretation as NB -block events on whenever L is 
divisible by NB. If A is such an NB -block event, let pp(A) denote the analogue of the quantity 
from (2.11) where the 6>t's now involve translations by multiples of NB. Our next technical 
lemma provides an estimate on p^Cjv) in terms of pp(B): 

Lemma 3.3 Let d be the dimension of the underlying lattice and suppose that d > 2. For 
each e > — underlying the definitions of Bn, £n and Cn — and each rj > 0, there exists a 
number 5 = 5(e, r/, d) > such that ifpp(B) < 5, then P@(Cn) < V- 

Proof. Let us use Ul^(Cn) to abbreviate the quantity 

n Ltf) (C N ) = P Li/3 ( P| t (C N j), (3.13) 

teT 

where T = T L u NB \ is the factor torus in the present context. Observing that Cn is preserved 
by reflections through the "midplanes" of A.nb, a multivariate version of Chebyshev's inequality 
then yields 

w^fn^). 0.14, 

teT 

Here ^l,/3 is the expectation with respect to ^l,/3- 

To estimate the right-hand side of (3.14), we will rewrite Yjv as a sum. Let x,y G Ajy_i 
be distinct. A connected subset V C Ajv-i is said to separate x from y (in An_i) if each 
nearest-neighbor path ir from x to y on Ajv-i intersects V. We use &(x, y) to denote the set 
of all such sets Y C Ajv-i. Notice that {x},{y} G &(x,y). We claim that, whenever (x,y) 
is a pair of points contributing to Yjv, there exists V G &(x,y) separating x from y such that 
every block A# + Bz with z G T is bad. Indeed, if A^ + Bx is a bad block we take V = {x}. 
If Ab + Bx is a good block, then we define ^ x to be the maximal connected subset of Ajv_i 
containing x such that A# + Bz is a good block for all z G and let T be its external boundary. 
Using 1 r to denote the indicator of the event that every block A# + Bz with z G T is bad, we get 

E E 1r - ^ 3 - 15 ^ 

^,y^^N-i ree(x,y) 

Let K = (^jf) d be the volume of the factor torus and let ti, . . . , tx be an ordering of all sites 
of T. Then we have 

^lAC^^jJ^ J2 E ^Jll^oTBmX (3.16) 

V ^ {x jtVj ) Ti,...,T K V i=l 7 

j=l,...,K 

where the first sum runs over collections of pairs (xj,yj), j = 1,...,K, of distinct sites in An_i 
and the second sum is over all collections of separating surfaces Tj G &(xj,yj), j = 1, . . . , K. 

To estimate the right-hand side of (3. 16) we define p L,p{B) to be the quantity on the right-hand 
side of (2.11), before taking the limit L — > oo, with A = B. Since each indicator 1^ ° TBNtj 
enforces bad blocks A# + B{z + Ntj) for z G Tj, and the set of blocks A# + B(z + Ntj), 
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z G Ajv-i, is, for ij / tj, disjoint from the set A# + B(z + Nti), z G Ajv-i, we can use 
chessboard estimates (Theorem 2.4) to get 

VlJU^ ,otsJ < [PL^(B)] |ri|+ "" +|rjc| . (3-17) 

A standard contour-counting argument now shows that, for any distinct x, y G Ajv-i, 

2 [pL )/3 (H)] |r| < Cl p^(B) d (3.18) 

ree(x,jO 

with some constant Ci = ci(<f), provided that pz,,/3(#) is sufficiently small. The sum over collec- 
tions of pairs (xj, yj), j = 1, . . . , K, contains at most (N 2d ) K terms, allowing us to bound 

n „( Cw ) < (3,9) 

Since U Lj/3 (C n ) 1/k -> P/3( c a0 as L -> oo, it follows that pp(C N ) < cipp(B) d e~ 2 , which 
for p 13(B) small enough, can be made smaller than any r/ initially prescribed. □ 

Our final technical ingredient is an estimate on the Radon-Nikodym derivative of a Gibbs 
measure jjl G ftp and the torus measure at the same temperature: 

Lemma 3.4 Let Al C 7L d be an L-block and let T2L be a torus of side 2L. Let us view Al as 
embedded into T 2 l and let ^2L,f3 be the torus Gibbs measure on T 2 l. Then for any a > there 
exists Lq such that 

e-^ Ld F 2L ^(A) < fi(A) < ^ aLd F 2LtP (A). (3.20) 
for all L > Lq, any \x G <8p, and any & ^-measurable event A. 

Proof. For finite-range interactions, this lemma is completely standard. However, since our 
setting includes also interactions with infinite range, we provide a complete proof. We will prove 
only the right-hand side of the above inequality; the other side is completely analogous. 

First, from the DLR equation we know that there exists a configuration s = (s x ) x€li d, such 
that 

h(A\&a°)(s) > fi{A) (3.21) 
with the left-hand side of the form (2.3). Let s' be a configuration on T 2 l- We will show that 
M • |^A t L )(s) and P2L,/3( • \^A c L )(s') are absolutely continuous with respect to each other — as 
measures on <^a l — and the Radon-Nikodym derivative is bounded above by ^ aL,d regardless of 
the "boundary conditions" s and s'. 

Suppose that s' x = s x for all x G A^ and let s' be its 2L-periodic extension to all of Z d . 
Then the Radon-Nikodym derivative of F 2 l,/3( ■ |^a c ){s') with respect to the product measure 
U xeAL M^x) ise-^^VZA^c.) while that of /x( ■ |^ A cJ( S ) is W/Z Al ( SA cJ. It 

thus suffices to show, uniformly in (s x ) x( z Al , that 

\H AL (s)-H AL (s')\<^L d (3.22) 
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once L is sufficiently large. To this end, we first note that 

\H Al (s)-H Al (s')\<2 Moo- ^ 3 - 23 ) 

A: AnA L ^d 

AnA c L ^9 

To estimate the right-hand side, we will decompose Al into "shells," A n \ A n _i, and use the fact 
that if A intersects A n \ A n _i as well as A C L , then the diameter of A must be at least L — n. Using 
the translation invariance of the interactions, we thus get 

L 

Y Halloo < ^|A„\A n _i| ll^lloo- (3-24) 

A: AnA L ^0 n=l A: OS A 

AnA c L ^0 diam(A)>L-n 

But |||<I>||| < oo implies that the second sum tends to zero as L — n — > oo and since |A n \ A„_i | = 
o(L d ) while 2^2i< n <L \ A n \ A„_i| = L d , the result is thus o(L d ). In particular, for L sufficiently 
large, the right-hand side of (3.23) will be less than |L d . □ 

3.2 Proofs of Theorem 2.5 and Corollary 2.6. 

Now we are ready to prove our main theorem: 

Proof of Theorem 2.5. Fix e < V2 and let 6 ftp be a 5-shift ergodic Gibbs measure for which 
one of the conditions (2.15-2.16) fails. Applying Lemma 3.1 and the inclusion in Lemma 3.2 we 
find that 

v{Rm,n(Cn) > V 2 ) > (3.25) 

once N > N and M > M (N). Now, consider the torus T L of side L = 2MNB and em- 
bed Amnb = UrreA M -i + NBx) into T/, in the "usual" way. By Lemma 3.4 we know that 
for my fixed N > Nq, there exists a sequence om of positive numbers with clm I as M — > 00, 
such that we have 

PL^(i?M,Ar(Civ) > V2) > ^e-«^) da - Md , M - 00. (3.26) 

Our goal is to show that, once N is chosen sufficiently large, the left-hand side is exponentially 
small in M d , thus arriving at a contradiction. 

By conditioning on which of the M d /2 translates of A^tv have Cn satisfied, and applying the 
chessboard estimates in blocks of side NB, we get 

VlA r m,n(C n ) > V2) < 2 Md p 2L ACN) Md/2 , (3.27) 

where p2L,/3(C/v) is the finite-torus version of p^Cjv). Next we choose rj < V4 and let 5 > 
and N > Nq be such that the bounds in Lemma 3.3 apply. Then for all sufficiently large M (and 
hence all large L) we have P2L,p(Cn) < V and so 

WlA r m,n(Cn) > V2) < (4r?) Md / 2 . (3.28) 

But this is true for all M » 1 and so the bound (3.26) must be false. Hence, no such fi 6 &p 
could exist to begin with; i.e., (2.15-2.16) must hold for all i?-shift ergodic ji £ (6/3. □ 
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To finish our proofs, we will also need to establish our claims concerning phase coexistence: 

Proof of Corollary 2.6. Suppose that e and 5 are such that Theorem 2.5 applies. By condition (1), 
the conclusions (2.15-2.16) of this theorem are thus available for all (3 G [(3i, /%]. This implies 

Pfl (Gj) G [0,e] U [l-e,l], j = l,2, (3.29) 

for every S-shift ergodic p, G <8p at every (3 G [/3i, /?2]- We claim that p^-i) is small in every 
ergodic state p G 0^. Indeed, by Lemma 6.3 of [5] and condition (2) of the corollary, we have 

Ph {B U g 2 ) < p Pl (B) + P/3l (Gj) < 25. (3.30) 

Hence, if the 5 in Corollary 2.6 was so small that Theorem 2.5 applies for some e < V2 even 
when 5 is replaced by 25, we can regard B U G2 as a bad event dl (3 = (3\ and conclude 
that p^G'i) < V2, and hence p fl (G2) < e, by (3.29), in every ergodic /i G 0^ . A similar 
argument proves that p fi (Gi) < e in every ergodic p G 0/3 2 . Usual weak-limit arguments then 
yield the existence of at least one point /3 t G {(3\, /%) where both types of goodness coexist. □ 



4. Applications 

The formulation of our main result is somewhat abstract. In the present section, we will pick 
several models in which phase coexistence has been proved using chessboard estimates and use 
them to demonstrate the consequences of our main theorem. Although we will try to stay rather 
brief, we will show that, generally, the hypothesis of our main result — i.e., the assumption on 
smallness of the parameter pp(B) — is directly implied by the calculations already carried out in 
the corresponding papers. The reader should consult the original articles for more motivation and 
further details concerning the particular models. 

4.1 Potts model. 

The g-state Potts model serves as a paradigm of order-disorder transitions. The existence of the 
transition has been proved by chessboard estimates in [25]. While the completeness of the phase 
diagram has, in the meantime, been established with the help of Pirogov-Sinai theory [28], we 
find it useful to illustrate our general claims on this rather straightforward example. Later on we 
will pass to more complex systems where no form of completeness — and, more relevantly, no 
"forbidden gap" — has been proved. 

The spins a x of the g-state Potts model take values in the set { 1 , ... , q} with a priori equal 
probabilities. The formal Hamiltonian is 

H(a) = -Y,S W (4-D 

where (x, y) runs over all (unordered) nearest-neighbor pairs in Z d . The states of minimal energy 
have all neighboring spins equal, and so we expect that low temperature states are dominated 
by nearly constant spin-configurations. On the other hand, at high temperatures the spins should 
be nearly independent and, in particular, neighboring spins will typically be different from each 
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other. This leads us to consider the following good events on 1-block 

= [a: a x / a y for all x, y G Ai, \x - y\ = l}, 

(4.2) 

gord,m = | ,j : ^ = m for all x G Ai } , m = 

Using similar events, it was proved [25] that, for d > 2 and q sufficiently large, there exists an 
inverse temperature (3 t and q + 1 ergodic Gibbs states /i dls G 0,3, and /U ord ' m G (3$, m = 1, . . . , (/, 
such that the corresponding 1-block densities satisfy 

/y ls (S dis ) > l-e (4.3) 

and 

p^m (g ord ' m ) > 1 - e, m = 1, . . . , q, (4.4) 

where e = e(q) tends to zero as q — > oo. In addition, monotonicity of the energy density as a 
function of /3 can be invoked to show that p^(G dls ) is large in all translation-invariant p G (3/3 
when /3 < /3 t , while it is small in all such states when j3 > (3 t . 

The full completeness [28] asserts that the above-mentioned q + 1 states exhaust the set of all 
shift-ergodic Gibbs states in (25$. A weaker claim follows as a straightforward application of our 
Theorem 2.5: For each shift-ergodic Gibbs state p, G (5$ f/iere is either p fl (Q dls ) > 1 — e or 
Pfi{Q ord,m ) > 1 — efor some m = 1, . . . , q. 

The main hypothesis of our theorem amounts to the smallness of the quantity ppiB), where 

q 

B= (g dis U (J £° rd > m ) C , (4.5) 

m=l 

which in turn boils down to an estimate on the probability of the disseminated event B on the right- 
hand side of (2.11). The needed estimate coincides with the bound provided in [25] by evaluating 
directly (i.e., "by hand") the energy and the number of contributing configurations. The result — 
which in [25] appears right before the last formula on p. 506 is used to produce (4.4') — reads 



This implies the needed bound once g> 1. 



^d-2-( d - 1 ) n J_ 

2d 



L (q - 2d) c 



(4.6) 



Remark 4. 1 Analogous calculations establish the corresponding forbidden gap in more compli- 
cated variants of the Potts model; see e.g. [4]. 



4.2 Intermediate phases in dilute spin systems. 

The first instance where our results provide some new insight are dilute annealed ferromagnets 
exhibiting staggered order phases at intermediate temperatures. These systems have been studied 
in the context of both discrete [10] and continuous spins [11]. The characteristic examples of 
these classes are the site-diluted Potts model with the Hamiltonian 

H(n, a) = - ^2 n x ny(S ax!ay -l)-A^n x -re^ n x n y (4.7) 
(x,y) x (x,y) 
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and the site-diluted XY -model with the Hamiltonian 

H(n, cf)) = - n x n v [cos(0 x - (j) y ) - l] - A ^ - k ^ n^. (4.8) 

(x,y) x (x,y) 

Here, as before, o x G {1, ...,<?} are the Potts spins, 4> x G [— 7r, 7r) are variables representing the 
"angle" of the corresponding 0(2)-spins, and n x G {0, 1} indicates the presence or absence of a 
particle (that carries the Potts spin a x or the angle variable <j) x ) at site x. 

On the basis of "usual" arguments, the high temperature region is characterized by disordered 
configurations while the low temperatures features configurations with a strong (local) order, at 
least at small-to-intermediate dilutions. The phenomenon discovered in [10, 11] is the existence 
of a region of intermediate temperatures and chemical potentials, sandwiched between the low 
temperature/high density ordered region and the high temperature/low density disordered region, 
where typical configurations exhibit preferential occupation of one of the even/odd sublattices. 
The appearance of such states is due to an effective entropic repulsion. Indeed, at low tempera- 
tures the spins on particles at neighboring sites are forced to be (nearly) aligned while if a particle 
is completely isolated, its spin is permitted to enjoy the full freedom of the available spin space. 
Hence, at intermediate temperatures and moderate dilutions, there is an entropic advantage for 
the particles to occupy only one of the sublattices. 

Let us concentrate on the portion of the phase boundary between the staggered region and the 
low temperature region. The claim can be stated uniformly for both systems in (4.7-4.8) provided 
we introduce the relevant good events in terms of occupation variable n. Namely, we let: 

gdense = |^ ^ . ^ = j for ^ % £ ^ 

= { ( a , n) : n x = 1 {x even} for all x G A x } , (4.9) 

G odd = { (<r, n) : n x = 1 {x odd} for all x G Ai } . 

Again, using slightly modified versions of these events, it was shown in [10, 11] that there exist 
positive numbers e, kq <C 1 and, for every k G an interval I(k) C R such that the 

following is true: For any A £ I there exist inverse temperatures (3\(k, A) and 02 (k, A), and a 
transition temperature (3 t (n, A) G [Pi, fo] such that 

(1) for any (3 G [f3 t , fa] there exists an "densely occupied" state ^ dense g <5p, for which 

V e„ S e(g denS£ ) > 1-e, (4.10) 

(2) for any G A] there exist two states ^, even , fi odd G <5p satisfying 

^even (a 6 ™) > 1 - € md p^odd > 1 - €. (4.11) 

The error e is of order (3~ 1/s (cf. the bound (2.15) in [11]) in the case of the XY-model in d = 2, 
and it tends zero as q —> oo in the case of the diluted Potts model. 

A somewhat stronger conclusion can be made for the diluted Potts model. Namely, at (3 = p t , 
there are actually q + 2 distinct states, two staggered states p even and /i odd and q ordered states 
/x dense ' m , with the latter characterized by the condition 

/9 dense,™ (g denS£ ' m ) > 1 - €, (4.12) 
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where 

£dense,m = { ^ ^ . ^ = j and ^ = m for all X G Ai}. (4. 13) 

It is plausible that an analogous conclusion applies to the XY-model in d > 3 because there the 
low-temperature phase should exhibit magnetic order. However, in d = 2 such long-range order 
is not permitted by the Mermin-Wagner theorem and so there one expects to have only 3 distinct 
ergodic Gibbs states at /3 t . 

A weaker form of the expected conclusion is an easy consequence of our Theorem 2.5: For 
each extremal 2-periodic Gibbs state p, G <8# there exists Q G {£ even , Q odd , £ dense } (in the case 
of diluted Potts model, Q G {£ even , £ odd , g dense > m , m = 1, . . . ,q}) such that 

P„(G)>l-e. (4.14) 

In particular, no ergodic Gibbs state p G has particle density in [e, V2 — e] U [V2 + e, 1 — e]. 
The proof of these observations goes by noting that the smallness of pp(B) for the bad event 
B = (^ dense u Q even u g odd y is a direct consequence of the corresponding bounds from [10, 11] 
of the "contour events." In the case of the XY-model in dimension d = 2, this amounts to the 
bounds (2.9) and (2.15) from [11]. 

Remark 4.2 A more general class of models, with spin taking values in a Riemannian manifold, 
is also considered in [11]. A related phase transition in an annealed diluted 0(n) Heisenberg 
ferromagnet has been proved in [12]. 



4.3 Order-by-disorder transitions. 

Another class of systems where our results provide new information are the 0(2)-nearest and 
next-nearest neighbor antiferromagnet [3], the 120-degree model [5], and the orbital-compass 
model [6]. All of these are continuum-spin systems whose common feature is that the infinite 
degeneracy of the ground states is broken, at positive temperatures, by long-wavelength (spin- 
wave) excitation. We will restrict our attention to the first of these models, the 0(2)-nearest and 
next-nearest neighbor antiferromagnet. The other two models are somewhat more complicated — 
particularly, due to the presence of non-translation invariant ground states — but the conclusions 
are fairly analogous. 

Consider a spin system on Z 2 whose spins, S x , take values on the unit circle in R 2 with a 
priori uniform distribution. The Hamiltonian is 

H(S) = '^2(S X • Sx+e!+e 2 + Sx " <SWei-e 2 ) + 7^^('S'a: ' <S^+ei + S x ■ S x+ s 2 ), (4.15) 

x x 

where Si and £2 are the unit vectors in the coordinate lattice directions and the dot denotes the 
usual scalar product. Note that both nearest and next-nearest neighbors are coupled antiferro- 
magnetically but with a different strength. The following are the ground state configurations 
for 7 G (—2,2): Both even and odd sublattices enjoy a Neel (antiferromagnetic) order, but the 
relative orientation of these sublattice states is arbitrary. 

It is clear that, at low temperatures, the configurations will be locally near one of the afore- 
mentioned ground states. Due to the continuous nature of the spins, the fluctuation spectrum 
is dominated by "harmonic perturbations," a.ka. spin waves. A heuristic spin-wave calculation 
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(cf. [5, Sect. 2.2] for an example in the context of the 120-degree model) suggests that among 
all 2ir possible relative orientations of the sublattices, the parallel and the antiparallel orienta- 
tions are those entropically most favorable. And, indeed, as was proved in [3], there exist two 
2-periodic Gibbs states \i\ and p2 with the corresponding type of long-range order. However, the 
existence of Gibbs states with other relative orientations has not been ruled out. 

We will now state a stronger version of [3, Theorem 2.1]. Let B be a large even integer and 
consider two S-block events Q\ and Q2 defined as follows: fixing a positive re -C 1, let 

p| {S x -S y >l-re}n fl {S x ■ S x+&2 < -1 + re}, (4.16) 

(y-x)-e 2 =0 

i.e., Qi enforces horizontal stripes all over A#. The event Q2 in turn enforces vertical stripes; the 
definition is as above with the roles of ei and 82 interchanged. Then we have: 

Theorem 4.3 Let 7 G (0, 2) and let k <C 1. For each e > there exists (3q 6 (0, 00) such that 
for each (3 > (3$: 

(1) There exist two ergodic Gibbs states fx±, P2 £ 0/?, such that 

P H (Sj)>l-e, J = 1,2. (4.17) 

(2) There exists an integer B > 1 such that for any \i G 0/3 that is ergodic with respect to shifts 
by multiples of B we have 

either p^G x ) > 1 - e or Pil (Q 2 ) > 1 - e. (4.18) 

The first conclusion — the existence of Gibbs states with parallel and antiparallel relative orien- 
tation of the sublattices — was the main content of Theorem 2.1 of [3]. What we have added here is 
that the corresponding configurations dominate all ergodic Gibbs states. The 0(2) ground-state 
symmetry of the relative orientation of the sublattices is thus truly broken at positive tempera- 
tures, which bolsters significantly the main point of [3]. Note that no restrictions are posed on 
the overall orientation of the spins. Indeed, by the Mermin- Wagner theorem every p £ <5p is 
invariant under simultaneous rotations of all spins. 

Proof of Theorem 4.3. As expected, the proof boils down to showing that, for a proper choice 
of scale B we have pp{B) <C 1 for B = (Qi U £2)°- In [3] this is done by decomposing B into 
more elementary events — depending on whether the "badness" comes from excessive energy or 
insufficient entropy — and estimating each of them separately. The relevant bounds are proved in 
[3, Lemmas 4.4 and 4.5] and combined together in [3, Eq. (4.20)]. Applying Theorem 2.5 of the 
present paper, we thus know that every 5-shift ergodic \i 6 0^ is dominated either by blocks 
of type Qi or by blocks of type Q2. Since p^(B) < e in all states, the existence of pi,p2 £ 0/3 
satisfying (4.17) follows by symmetry with respect to rotation (of the lattice) by 90-degrees. □ 

4.4 Nonlinear vector models. 

A class of models with continuous symmetry that are conceptually close to the Potts model has 
been studied recently by van Enter and Shlosman [17]. As for our previous examples with con- 
tinuous spins, Pirogov-Sinai theory is not readily available and one has to rely on chessboard 
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estimates. We will focus our attention on one example in this class, a nonlinear ferromagnet, 
although our conclusions apply with appropriate, and somewhat delicate, modifications also to 
liquid crystal models and lattice gauge models discussed in [17]. 

Let us consider an 0(2)-spin system on 1? with spins parametrized by the angular variables 
<p x G (— 7T, 7r]. The Hamiltonian is given by 

^) = -E( 1 + c °t a ~^ ) P > (4 - 19) 

where p is a nonlinearity parameter. The a priori distribution of the <\> x 's is the Lebesgue measure 
on (— 7r, 7r]; the difference <p x — <p y is always taken modulo 2ir. 

In order to define the good block events, we first split all bonds into three classes. Namely, 
given a configuration {(f>x) x ^i?, we sav tnat the bond (x, y) is 

(1) strongly ordered if \<j> x — <j> y \ < jj^f, 

(2) weakly ordered if ^= < \<f> x — <f) y \ < -^=, and 

(3) disordered if \cj) x — (j> y \ > 

Here C is a large number to be determined later. If a bond is either strongly or weakly ordered, 
we will call it simply ordered. 

On the basis of (4.19), it is clear that strongly ordered bonds are favored energetically while 
the disordered bonds are favored entropically. The main observation of [17] — going back to [14, 
25, 1] — is that, at least in torus measures, ordered and disordered bonds are unlikely to occur 
in the same configuration. This immediately implies coexistence of at least two distinct states 
at some intermediate temperature. Moreover, since it is also unlikely to have many bonds in 
the "borderline" region \(f> x — <f> y \ -^=, the transition is accompanied by a jump in the energy 
density. But, to prove that the energy gap stays uniformly positive as p — > oo, it appears that 
one needs to establish the existence of a free-energy barrier between the strongly ordered and 
disordered phases. 

Let Ai be a 1-block (i.e., a plaquette) and let us consider the following good events on Ai: The 
event that all bonds on Ai are strongly ordered, 

Gso = [\<Px-4>y\ < Vx,y G Ai, \x - y\ = l}, (4.20) 

and the event that all bonds on Ai are disordered, 

C_ 

Then we have: 



( C i 

Sdis = [\4>x -<t> y \ > Vx,y G A u \x - y\ = lj. (4.21) 



Theorem 4.4 For each e > and each sufficiently large C > 1, there exists po > such 
that for all p > po, there exists a number (3 t £ (0, oo) and two distinct, shift-ergodic Gibbs 
states |U S0 , n dls G (25$ such that 

/V°(£ so ) > 1 - e and p^G&s) > 1 - e. (4.22) 
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In addition, for all shift-ergodic Gibbs states p G &/3 t , we have 

either p^Gdk) > 1 - e or p M (£ so ) > 1 - e, (4.23) 

while 

PuiGso) > 1 — e for all shift-ergodic [i G 0/3 with (3 > fit (4.24) 

and 

Pn(Gdis) > 1 — e for all shift-ergodic p G <5p with (3 < {3 t . (4.25) 
Finally, for every p > po and C large, every ergodic Gibbs state will have energy near zero 
when (3 > (3 t and at least 1 — 0(C~ 2 ) when (3 < (3 t . 

We remark that the existence of a first-order transition in energy density has been a matter of 
some controversy in the physics literature; see [16, 17] for more discussion and relevant refer- 
ences. The proof of Theorem 4.4 is fairly technical and it is therefore deferred to Sect. 5. 

4.5 Magnetostriction transition. 

Our final example is the magnetostriction transition studied recently by Shlosman and Zagreb- 
nov [33]. The specific system considered in [33] has the Hamiltonian 

H(a, r) = -^2,J (r Xt y)a x a y + k ^ (r X)W - R) 2 + A ^ (r x>y - r z>y ) 2 . (4.26) 

{x,y) (x,y) {x,y),(z,y) 

\x-z\=V5 

Here the sites x 6 7L d label the atoms in a crystal; the atoms have magnetic moments represented 
by the Ising spins a x . The crystal is not rigid; the variables r x ^ y G R, r x ^ y > 0, play the role of 
spatial distance between neighboring crystal sites. 

The word magnetostriction refers to the phenomenon where a solid undergoes a magnetic 
transition accompanied by a drastic change in the crystalline structure. In [33] such a transition 
was proven for interaction potentials J = J(r x ^ y ) that are strong at short distances and weak at 
large distances. The relevant states are characterized by disjoint contracted, 

gcontr = {(r,a):r X:y < v , Vx, y G A 1 ,\x-y\ = l}, (4.27) 

and expanded, 

gex P ,± = ^ r a y r ^ > r? + 6) yx ^ y G Ai,|x-i/| = l}n{a x = ±1, Vx G Ai}, (4.28) 

block events. The parameters i] and e can be chosen so that there exists (3 t G (0, oo) for which 
the following holds: 

(1) For all (3 < (3 t there exists an expanded Gibbs state ^ exp G <5p such that p M ex P (£/ ex P) > 3 / 4 ; 

(2) For all (3 > j3 t there exist two distinct contracted Gibbs states ^ contr ' ± G (S/3 such that 
*y a *,±(0 contr ' ± )> 3 /4. 

In particular at (3 = (3 t there exist three distinct Gibbs states; one expanded and two contracted 
with opposite values of the magnetization. The authors conjecture that these are the only shift- 
ergodic Gibbs states at (3 = (3 t . 

Unfortunately, the above system has unbounded interactions and so it is not strictly of the form 
for which Theorem 2.5 applies. Instead we will use the following generalization: 
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Theorem 4.5 Let d > 2 and consider a spin system with translation-invariant finite-range 
interaction potentials {<&A)A& d sucn mat tne torus measure is reflection positive for all even L. 
Let Qi, . . . ,Q r be a collection of good B-block events satisfying the requirements in Theorem 2.5 
and let B be the corresponding bad event. Then for all e > there exists 5 > — depending 
possibly only on d but not on details of the model nor on n or B — such that for all (5 > for 
which pp(B) < 5 the following is true: If fi € <& p is a B-shift ergodic Gibbs state with 

< 00, (4.29) 

A: A<gl d 

oeA 

then we have 

p„(B)€[0,e] (4.30) 

and there exists i £ {1, . . . , r} such that 

P^Qi) > 1 - e. (4.31) 

Proof. The proof is virtually identical to that of Theorem 2.5 with one exception: Since the 
interactions are not bounded, we cannot use Lemma 3.4 directly. Suppose we have a Gibbs 
state \i that obeys (4.29) but violates one of the conditions (4.30-4.31). Let Rm,n{Cn) be as in 
(3.4). Lemma 3.1 still applies and so we have (3.5) for some N. 

Let L = MNB and let T>m be the event that the boundary energy in the box A is less 
than cM d_1 , i.e., 

V M = { \®A\<cM d ~ 1 \. (4.32) 

A: AnAz,^0 J 
AnA c L ^0 

where c is a positive constant. In light of the condition (4.29), the fact that the interaction has a 
finite range, and the Chebyshev bound, it is clear that we can choose c so that p(V c M ) < (4A^ d ) _1 
for all M. Hence, we have 

fi(V M n {R m ,n(Cn) > V2}) > (4.33) 

Next let s and s' be as in the proof of Lemma 3.4 and suppose that both s and s' belong to %. 
Then, by definition, 

\H Al (s) - H Al (s')\ < 2cM d ~ 1 (4.34) 
and, applying the rest of the proof of Lemma 3.4, we thus have 

fi(V M n {Rm,n(C n ) > V2}) < e^ cMd " 1 P 2L ^(© M n {Rm,n(C n ) > V2}) • (4.35) 

Neglecting T>l on the right-hand side and invoking (3.28), we again derive the desired contradic- 
tion once M is sufficiently large. □ 

With Theorem 4.5 in the hand, we can extract the desired conclusion for the magnetostriction 
transition. First, the energy condition is clearly satisfied in any state generated by tempered 
boundary conditions. We then know that, in every such ergodic state fi, only a small number 
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blocks will feature bonds that are neither contracted (and magnetized) nor expanded (and non- 
magnetized): 

P M (S eXP )> /v(S eXP,± ) e [0, e] U [1 - e, 1] and ^(B) < e. (4.36) 

The existence of a phase transition follows by noting that the contracted states have less energy 
than the expanded ones; there is thus a jump in the energy density as the temperature varies. 



5. Appendix 

The goal of this section is to prove Theorem 4.4 which concerns the non-linear vector model with 
interaction (4.19). The technical part of the proof is encapsulated into the following claim: 

Proposition 5.1 There exists a constant Cq > such that for all 5 > and all C > Cq the 
following holds: There exists po > such that for all p > po we have 



and 



sup pp{(g so u g^y) < s (5.1) 

0>O 



lim pdOdis) = and lim pp(G so ) < 8. (5.2) 

/3^oo 010 



To prove this proposition, we will need to carry out a sequence of energy and entropy bounds. 
To make our energy estimates easier, and uniform in p, we first notice that there are constants < 
a < b such that 

e- to2 < 1 + C 2 OS(g °<e— \ -1<x<1. (5.3) 

The argument commences by splitting the bad event B = (Q so U £?dis)° into two events: The 
event B wo that Ai contains a weakly-ordered bond, and B m [ x = B \ B wo which, as a moment's 
thought reveals, is the event that Ai contains two adjacent bonds one of which is strongly ordered 
and the other disordered. The principal chessboard estimate yields the following lemma: 

Lemma 5.2 Suppose that C < ^fp. Then 

P^wo) < 4 (min{f *-m-^l*--*'*\ _^ }) % (5.4) 

and 

P 0(B mix ) < 4(ndn{e-^- Voa - 1 --° a ] > ^ {^J U }) % (5-5) 
for all (3 > and all k G (0, 1). Moreover, we have 

p0(Gdis) < vrCVP exp{-2/3[e-^ - e"^ 2 ]} (5.6) 

and 

1 e 2 ^ 

p0(Qso) < (5.7) 
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Proof. Let Zl be the partition function obtained by integrating e /3Hl over all allowed configu- 
rations. Consider the following reduced partition functions: 

(1) Zf & , obtained by integrating e _/3//L subject to the restriction that every bond in is dis- 
ordered. 

(2) Z™, obtained similarly while stipulating that every bond in is strongly ordered. 

(3) ZY°, in which every bond in Tj, is asked to be weakly ordered. 

(4) Z™ x , enforcing that every other horizontal line contains only strongly-ordered bonds, and 
the remaining lines contain only disordered bonds. A similar periodic pattern is imposed on 
vertical lines as well. 

To prove the lemma, we will need upper and lower bounds on the partition functions in (1-2), and 
upper bounds on the partition functions in (3-4). 

We begin by upper and lower bounds on Zf^. First, using the fact that the Hamiltonian is 
always non-positive, we have q^I 3H l > 1. On the other hand, the inequalities (5.3) and a natural 
monotonicity of the interaction imply that 

^ 1 + cos(<^-<^) \p < ^ 1 + casjC/y/p) y < e _ aC 2 (5 g) 

whenever (x,y) is a disordered bond. In particular, —(3Hl is less than 2/3e~ aC2 \Tl\ for every 
configuration contributing to Zff. Using these observations we now easily derive that 

(2tt)I^I < Zf < (2tt)I^I e 2 ^ e " aC2 | T ^. (5.9) 
Similarly, for the partition function Z % £ we get 



e 



<Zr<2ne 2 ^(-^-y Ll . (5.10) 



Indeed, for the upper bound we first note that —(3Hl < 2/3\Tl\. Then we fix a tree spanning all 
vertices of T^, disregard the constraints everywhere except on the edges in the tree and, starting 
from the "leaves," we sequentially integrate all site variables. (Thus, each site is effectively forced 
into an interval of length except for the "root" which retains all of its 2ir possibilities.) For 
the lower bound we fix a number k G (0,1) and restrict the integrals to configurations such that 
\4>x - 4> y \ < for all bonds (x,y) in T L . The bound -pH L > 2(3e~ hK ' 2 / c2 \T L \ then permits 
us to estimate away the Boltzmann factor for all configurations; the entropy factor reflects the 
fact that each site can vary throughout an interval of length at least ^yp- 

Next we will derive good upper bounds on the remaining two partition functions. First, similar 
estimates as those leading to the upper bound in (5. 10) give us 

zrWe^-'^Y 1 ''. (5.11) 

For the partition function Z™ 1X we note that V4 of all sites are adjacent only to disordered bonds, 
while the remaining % are connected to one another via a grid of strongly-ordered bonds. Esti- 
mating —(3Hl < (3(1 + e~ aC )\Tl\ for all relevant configurations, similar calculations as those 
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leading to (5.10) again give us 

ZT < 2^^- aC2 ^\ (2vr)^ (^ = ) 5|Ti|_1 . (5.12) 

It now remains to combine these estimates into the bounds on the quantities on the left-hand side 
of (5.4-5.5) and (5.6-5.7). 

We begin with the bound (5.6). Clearly, p/j^dis) is the L -► oo limit of {Zf* /Zl) 1 ^^, which 
using the lower bound Zj, > Z S L ° with re = 1 easily implies (5.6). The bound (5.7) is obtained 
similarly, except that now we use that Zl > Z^ 1S . The remaining two bounds will conveniently 
use the fact that for two-dimensional nearest-neighbor models, and square tori, the torus mea- 
sure P_f,,/3 is reflection positive even with respect to the diagonal planes in T^. Indeed, focusing 
on (5.4) for a moment, we first note that B wo is covered by the union of four (non-disjoint) events 
characterized by the position of the weakly-ordered bond on Ai. If B^} is the event that the 
lower horizontal bond is the culprit, the subadditivity property of p@ — see Lemma 6.3 of [5] — 

gives us pp(B wo ) < 4pp(Bwo )• Disseminating 6^' using reflections in coordinate directions, we 
obtain an event enforcing weakly-ordered bonds on every other horizontal line. Next we apply a 
reflection in a diagonal line of even parity to make this into an even parity grid. From the per- 
spective of reflections in odd-parity diagonal lines — i.e., those not passing through the vertices of 
the grid — half of the "cells" enforces all four bonds therein to be weakly ordered, while the other 
half does nothing. Applying chessboard estimates for these diagonal reflections, we get rid of the 
latter cells. The result of all these operations is the bound 

. ywo . 1 

Pp(B wo )< lim 4(-M 4|Til . (5.13) 

Estimating Zl from below by the left-hand sides of (5.9-5.10) now directly implies (5.4). 

The event B m i x is handled similarly: First we fix a position of the ordered-disordered pair 
of bonds and use subadditivity of p^ to enforce the same choice at every lattice plaquette; this 
leaves us with four overall choices. Next we use diagonal reflections to produce the event underly- 
ing Z™ 1X . Estimating Zl from below by V4-th power of the lower bound in (5.9) and 3 /4-th power 
of the lower bound in (5.10) with re = 1, we get the first term in the minimum in (5.5). To get the 
second term, we use that Z L > Zf\ apply (5.12) and invoke the bound 1 + e' aC < 2. □ 

Proof of Proposition 5.1. The desired properties are simple consequences of the bounds in 
Lemma 5.2. Indeed, if C is so large that e~ b / c2 > e~ aC * 2 , then (5.6) implies that pp(Qdh) — ► 
as (5 — ► oo. On the other hand, (5.7) shows that the (3 — > limit of ppiGso) is order l /^p, which 
can be made as small as desired by choosing p sufficiently large. 

To prove also (5.1), we first invoke Lemma 6.3 of [5] one last time to see that pp{B) < 
Pp(Bwo) +P/3 (S m ix). We thus have to show that both p / 3(B v/0 ) and p j a(^ m i x ) can be made arbitrary 
small by increasing p appropriately. We begin with p / 3(0 m i x ). Let C be so large that 

\e- b ' c2 - 1 - e~ aC2 > 0. (5.14) 

Then for such that e 2/3 > p 1/A the first term in the minimum in (5.6) decays like a negative 
power of p, while for the complementary values of (3, the second term is 0(p~ /s ). As to the 
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remaining term, pp(B w0 ), here we choose k G (0, 1) such that 

e - k2 / c2 -e^ 2 >0, (5.15) 

and apply the first part of the minimum in (5.4) for (3 with e 2lS > yjp, and the second part for the 
complementary (3, to show that p/3(B W0 ) is also bounded by constants time a negative power of p, 
independently of (3. Choosing p large, (5.1) follows. □ 

Now we can finally prove Theorem 4.4: 

Proof of Theorem 4.4. We will plug the claims of Proposition 5.1 in our main theorem. First, 
it is easy to check that the good block events Q so and satisfy the conditions (1) and (2) of 
Theorem 2.5. Then (5.1) and (2.15-2.16) imply that 

either p^Qdk) > 1 - e or p M (^ so ) > 1 - e (5.16) 

for all shift-ergodic Gibbs states /i 6 (3/3 and all [3 E (0, oo). The limits (5.2) and Corollary 2.6 
then imply the existence of the transition temperature (3 t and of the corresponding coexisting 
states. Since the energy density with negative sign undergoes a jump at /3 t from values e~ h l c 
to values ;$ e~ aC — which differ by almost one once C S> 1 — all ergodic states for (3 > (3 t must 
have small energy density while the states for (3 < (3 t will have quite a lot of energy. Applying 
(5.16), all ergodic fi 6 (3/3 for (3 > (3 t must be dominated by strongly-ordered bonds, while those 
for (3 < (3 t must be dominated by disordered bonds. □ 
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